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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

{i) APPLICANT: 

(A) NAME: PLANT BIOSCIENCE LIMITED 

(B) STREET: COLNEY LANE 

(C) CITY: NORWICH 

(D) STATE: NORFOLK 

(E) COUNTRY: UK 

(F) POSTAL CODE <ZIP) : NR4 7UH 
(ii) TITLE OF INVENTION: PROTEIN 

(iii) NUMBER OF SEQUENCES: 4 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1205 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 18 . . 917 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CCAAATCCCA TCTCAAA ATG AAT GAT CTA TTT TCA GGA TCT TTC TCT CGT SO 
Met Asn Asp Leu Phe Ser Gly Ser Phe Ser Arg 
15 10 

TTC AGA GCT GAC GAT CAA TCG GAC TCT CAC GCC ATA GAA ATG GGA GAC 98 
Phe Arg Ala Asp Asp Gin Ser Asp Ser His Ala lie Glu Met Gly Asp 
15 20 25 

ATT ACT GGC GGA GTC AAT CTC GAC AAA TTC TTC GAA GAT GTT GAA GCC 146 
lie Thr Gly Gly Val Asn Leu Asp Lys Phe Phe Glu Asp Val Glu Ala 
30 35 40 

ATT AAA GAC GAA CTC AAA GGC CTC GAG AAA ATC TAT TCC CAA CTC CAA 194 
lie Lys Asp Glu Leu Lys Gly Leu Glu Lys lie Tyr Ser Gin Leu Gin 
45 50 55 

TCT TCC CAT GAA AAA AGC AAG ACT CTT CAC AAC GCT AAA GCC GTT AAA 242 
Ser Ser His Glu Lys Ser Lys Thr Leu His Asn Ala Lys Ala Val Lys 
60 65 70 75 

GAT CTA AGA TCC AAC ATG GAT AAT GAC GTT TCC ATG GCA TTG AAG AAA 290 
Asp Leu Arg Ser Asn. Met Asp Asn Asp Val- Ser Met Ala Leu Lys Lys 
• 80 85 90 



GCC AAA TTC ATC AAA GTT CGT CTC GAA GCC TTA GAC AGA TCA AAT GCA 



338 
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Ala Lys Phe lie Lys Val Arg Leu Glu Ala Leu Asp Arg Ser Aan Ala 
95 100 105 

GCG AAT CGA AGC CTC CCT GGA TGT GGA CCC GGA AGT TCA TCT GAC AGG 386 
Ala Asn Arg Ser Leu Pro Gly Cys Gly Pro Gly Ser Ser Ser Asp Arg 
110 115 120 

ACG AGA ACT TCA GTT GTG AAC GGA TTA AGG AAG AAA CTT CAA GAG TCA 434 
Thr Arg Thr Ser Val Val Asn Gly Leu Arg Lys Lys Leu Gin Glu Ser 
125 130 135 

ATG AAT CAG TTC AAC GAG CTA AGG CAA AAG ATG GCA TCT GAA TAT AGG 482 
Met Asn Gin Phe Asn Glu Leu Arg Gin Lys Met Ala Ser Glu Tyr Arg 
140 145 150 155 

GAA ACA GTT CAA CGA CGA TAT TAT ACC GTC ACA GGA GAA AAT CCT GAT 530 
Glu Thr Val Gin Arg Arg Tyr Tyr Thr Val Thr Gly Glu Asn Pro Asp 
160 165 170 

GAA GCA. GTT CTT GAT ACA CTC ATA TCT ACA GGT CAA AGT GAG ACG TTC 578 
Glu Ala Val Leu Asp Thr Leu lie Ser Thr Gly Gin Ser Glu Thr Phe 
175 180 185 

TTG CAA AAG GCA ATT CAA GAG CAA GGG AGA GGA CAA GTG ATG GAT ACA 626 
Leu Gin Lys Ala lie Gin Glu Gin Gly Arg Gly Gin Val Met Asp Thr 
190 195 200 

GTT ATG GAA ATT CAA GAA AGG CAT GAA GCT GTG AAG GAA TTG GAG AGG 674 
Val Met Glu He Gin Glu Arg His Glu Ala Val Lys Glu Leu Glu Arg 
205 210 215 

AAT TTG AAA GAA TTG CAT CAA GTA TTC TTG GAC ATG GCT GTT TTG GTT 722 
Asn Leu Lys Glu Leu His Gin Val Phe Leu Asp Met Ala Val Leu Val 
220 225 230 235 

GAA AGT CAA GGA GCT CAA CTT GAT GAT ATT GAG AGC CAA GTG AAT AGG 770 
Glu Ser Gin Gly Ala Gin Leu Asp Asp lie Glu Ser Gin Val Asn Arg 
240 245 250 

. GCT AAT TCC TTC GTT AGA GGG GGT GCT CAG CAA CTG CAA GTG GCA AGG 818 
Ala Asn Ser Phe Val Arg Gly Gly Ala Gin Gin Leu Gin Val Ala Arg 
255 260 265 

AAG CAC CAG AAG AAC ACT AGA AAA TGG ACT TGT TTT GCT ATT ATT CTT 866 
Lys His Gin Lys Asn Thr Arg Lys Trp Thr Cys Phe Ala He He Leu 
270 275 280 

CTG CTT ATC ATC ATT TTG GTG GTG GTT CTT TCT ATT CAG CCA TGG AAA 914 
Leu Leu He He He Leu Val Val Val Leu Ser He Gin Pro Trp Lys 
285 290 295 

AAA TGAGAATTTG TCTATGGTCA AAGGTCTTCT GGTGGACCCC TTCAATGTTT 967 

Lys 

300 

TGAATATTCT AAATTTTTAT ATTTTATTAT TTTAGCCATG CTTATTATTT TGTGTTATTT 1027 

TGGATTTTTT TTTTGTTTTT AATGTGGGGA AGAGTAAACT GGATGGGGGT CCATGTGCTA 1087 

TTTAGAGAAA TACTTGGGAG TTCTCTTTTT GTAATTATTG CTGTATTTAG AGTATAATTC 1147 

TTTTTCTATA TTGTTGGCAG GTTAATTTGT TTGTTTGATT ATATTCTCAT TTAGATTT 1205 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2x 

Met Asn Asp Leu Phe Ser Gly Ser Phe Ser Arg Phe Arg Ala Asp Asp 
15 10 15 

Gin Ser Asp Ser His Ala lie Glu Met Gly Asp He Thr Gly Gly Val 
20 25 30 



Asn Leu Asp Lys Phe Phe Glu Asp Val Glu Ala He Lys Asp Glu Leu 
35 40 45 

Lys Gly Leu Glu Lys He Tyr Ser Gin Leu Gin Ser Ser His Glu Lys 
50 55 60 

Ser Lys Thr Leu His Asn Ala Lys Ala Val Lys Asp Leu Arg Ser Asn 
65 70 75 80 

Met Asp Asn Asp Val Ser Met Ala Leu Lys Lys Ala Lys Phe He Lys 
85 90 95 

Val Arg Leu Glu Ala Leu Asp Arg Ser Asn Ala Ala Asn Arg Ser Leu 
100 105 110 

Pro Gly Cys Gly Pro Gly Ser Ser Ser Asp Arg Thr Arg Thr Ser Val 
115 120 125 

Val Asn Gly Leu ^g Lys Lys Leu Gin Glu Ser Met Asn Gin Phe Asn 
130 135 140 

Glu Leu Arg Gin Lys Met Ala Ser Glu Tyr Arg Glu Thr Val Gin Arg 
145 ISO 155 160 

Arg Tyr Tyr Thr Val Thr Gly Glu Asn Pro Asp Glu Ala Val Leu Asp 
165 170 175 

Thr Leu He Ser Thr Gly Gin Ser Glu Thr Phe Leu Gin Lys Ala He 
180 185 190 

Gin Glu Gin Gly Arg Gly Gin Val Met Asp Thr Val Met Glu He Gin 
195 200 205 

Glu Arg His Glu Ala Val Lys Glu Leu Glu Arg Asn Leu Lys Glu Leu 
210 215 220 

His Gin Val Phe Leu Asp Met Ala Val Leu Val Glu Ser Gin Gly Ala 
225 230 235 240 

Gin Leu Asp Asp He Glu Ser Gin Val Asn Arg Ala Asn Ser Phe Val 
245 250 255 



Arg Gly Gly Ala Gin Gin Leu Gin Val Ala Arg Lys His Gin Lys Asn 
260 265 270 



Thr Arg Lys Trp Thr Cys Phe Ala He He Leu Leu Leu He He He 
275 280 285 



Leu Val Val Val Leu Ser He Gin Pro Trp Lys Lys 
290 295 300 



(2) INFORMATION FOR SEQ ID NO: 3: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 77.. 991 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAATTCCTCG AGCTACGTCA GGGATTCATT CCGATCTGAA ATCTCTCTCT AGATTTCTCT 60 

ATTTTTCGAA TTTTAA ATG AAC GAT TTG TTT TCC AGC TCA TTC TCT CGC 109 
Met Asn Asp Leu Phe Ser Ser Ser Phe Ser Arg 
15 10 

TTC CGC AGC GGA GAA CCA TCC CCT CGC CGA GAC GTT GCC GGC GGT GGC 157 
Phe Arg Ser Gly Glu Pro Ser Pro Arg Arg Asp Val Ala Gly Gly Gly 
15 20 25 

GAC GGA GTT CAG ATG GCG AAT CCC GCG GGA TCA ACC GGT GGT GTG AAC 205 
Asp Gly Val Gin Met Ala Asn Pro Ala Gly Ser Thr Gly Gly Val Asn 
30 35 40 

CTC GAC AAG TTC TTC GAA GAT GTT GAA TCT GTG AAA GAA GAG CTA AAG 253 
Leu Asp Lys Phe Phe Glu Asp Val Glu Ser Val Lys Glu Glu Leu Lys 
45 50 55 

GAG CTA GAT CGG CTC AAC GAA ACA CTC TCT TCA TGT CAC GAG CAG AGC 301 
Glu Leu Asp Arg Leu Asn Glu Thr Leu Ser Ser Cys His Glu Gin Ser 
60 65 70 75 

AAG ACG CTT CAC AAT GCT AAA GCC GTT AAA GAT CTC CGG TCT AAA ATG 349 
Lys Thr Leu His Asn Ala Lys Ala Val Lys Asp Leu Arg Ser Lys Met 
80 85 90 

GAC GGT GAC GTT GGA GTC GCG TTG AAG AAG GCG AAG ATG ATT AAA GTT 397 
Asp Gly Asp Val Gly Val Ala Leu Lys Lys Ala Lys Met lie Lys Val 
95 100 105 

AAA CTC GAG GCG CTA GAT CGT GCC AAT GCT GCT AAT CGG AGT CTC CCT 44 5 

Lys Leu Glu Ala Leu Asp Arg Ala Asn Ala Ala Asn Arg Ser Leu Pro 
110 115 120 

GGC TGT GGA CCT GGT TCT TCC TCC GAT CGA ACC AGG ACC TCT GTC CTC 4 93 

Gly Cys Gly Pro Gly Ser Ser Ser Asp Arg Thr Arg Thr Ser Val Leu 
125 130 135 

AAT GGT CTC AGG AAG AAA TTG ATG GAC TCT ATG GAT AGT TTC AAC CGA 541 
Asn Gly Leu Arg Lys Lys Leu Met Asp Ser Met Asp Ser Phe Asn Arg 
140 145 150 155 

TTG AGG GAG CTT ATC TCG TCC GAG TAT AGA GAA ACT GTA CAG AGG AGG 589 
Leu Arg Glu Leu lie Ser Ser Glu Tyr Arg Glu Thr Val Gin Arg Arg 
160 165 ' 170 

TAC TTC ACC GTC ACC GGC GAG AAT CCG .GAT GAA: CGA ACC CTA GAT CGA 637 
Tyr Phe Thr Val Thr Gly Glu Asn Pro Asp Glu Arg Thr Leu Asp Arg 
175 180 185 

CTG ATT TCC ACT GGA GAG AGT GAG AGA TTC TTG CAG AAA GCA ATA CAA 685 
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Leu lie Ser Thr Gly Glu Ser Glu Arg Phe Leu Gin Lys Ala lie Gin 
190 195 200 

GAA CAA GGA AGA GGA AGG GTG TTA GAC ACC ATT AAC GAG ATT CAA GAA 733 
Glu Gin Gly Arg Gly Arg Val Leu Asp Thr lie Asn Glu lie Gin Glu 
205 210 215 

AGG CAT GAT GCG GTT AAA GAC ATT GAG AAG AAT CTC AGG GAG CTT CAC 781 
Arg His Asp Ala Val Lys Asp lie Glu Lys Asn Leu Arg Glu Leu His 
220 225 230 235 

CAG GTG TTT CTA GAC ATG GCC GTG CTG GTA GAG CAC CAG GGA GCT CAG 829 
Gin Val Phe Leu Asp Met Ala Val Leu Val Glu His Gin Gly Ala Gin 
240 245 250 

CTT GAT GAC ATC GAG ACT CAT GTG GGT CGA GCT AGC TCC TTT ATC AGA 877 
Leu Asp Asp lie Glu Ser His Val Gly Arg Ala Ser Ser Phe lie Arg 
255 260 265 

GGC GGA ACT GAC CAG CTA CAA ACC GCT CGG GTT TAC CAG AAG AAC ACG 925 
Gly Gly Thr Asp Gin Leu Gin Thr Ala Arg Val Tyr Gin Lys Asn Thr 
270 275 280 

CGA AAA TGG ACA TGT ATT GCC ATT ATT ATT CTC ATC ATC ATC ATA ACT 973 
Arg Lys Trp Thr Cys lie Ala He He He Leu He He He He Thr 
285 290 295 

GTT GTG GTT CTT GCT GTT TTAAAACCGT GGAACAACAG CAGTGGCGGC 1021 
Val Val Val Leu Ala Val 
300 305 

GGCGGCGGTG GTGGTGGTGG GGGTACCACT GGAGGAAGTC AACCAAATTC AGGGACACCA 1081 

CCAAATCCTC CTCAGGCAAG GCGTCTATTG CGTTGAAGTT GAAGTTGAAG TTGAGTTTCG 1141 

TTATTTGCAT ATATATTCTT TCTTTGAAAA ACCTTATTAT CAAACCAGCT TTGTGTTACT 1201 

ACTTTCTACT GCTGGTTTGT TGTTAATCTC CCGTTTATTT GGTTTTTGTG AAAGAATTTA 1261 

AAATGTGGGT TAGATGAGAA AATTAGTACA ACATTCTCTT GTATCTATGT TTGCTACCCT 1321 

GACGTAGCTC GAG 1334 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 05 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Asn Asp Leu Phe Ser Ser Ser Phe Ser Arg Phe Arg Ser Gly Glu 
1 5 10 15 

Pro Ser Pro Arg Arg Asp Val Ala Gly Gly Gly Asp Gly Val Gin Met 
20 25 130 

Ala Asn Pro Ala Gly Ser Thr Gly Gly Val Asn Leu Asp Lys Phe Phe 
35 40 45 



Glu Asp Val Glu Ser Val Lys Glu Glu Leu Lys Glu Leu Asp Arg Leu 
50 55 60 
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Asn Glu Thr Leu Ser Ser Cys His Glu Gin Ser Lys Thr Leu His Asn 
65 70 75 80 



Ala Lys Ala Val Lys Asp Leu Arg Ser Lys Met Asp Gly Asp Val Gly 



85 



90 



95 



Val Ala Leu Lys Lys Ala Lys Met lie Lys Val Lys Leu Glu Ala Leu 
100 105 110 

Asp Arg Ala Asn Ala Ala Asn Arg Ser Leu Pro Gly Cys Gly Pro Gly 
115 120 125 

Ser Ser Ser Asp Arg Thr Arg Thr Ser Val Leu Asn Gly Leu Arg Lys 
130 135 140 

Lys Leu Met Asp Ser Met Asp Ser Phe Asn Arg Leu Arg Glu Leu lie 
145 150 155 160 

Ser Ser Glu Tyr Arg Glu Thr Val Gin Arg Arg Tyr Phe Thr Val Thr 
165 170 175 

Gly Glu Asn Pro Asp Glu Arg Thr Leu Asp Arg Leu He Ser Thr Gly 
180 185 190 

Glu Ser Glu Arg Phe Leu Gin Lys Ala He Gin Glu Gin Gly Arg Gly 
195 200 205 

Arg Val Leu Asp Thr He Asn Glu He Gin Glu Arg His Asp Ala Val 
210 215 220 

^■. 

Lys Asp He Glu Lys Asn Leu Arg Glu Leu His Gin Val Phe Leu Asp 
225 230 235 240 

Met Ala Val Leu Val Glu His Gin Gly Ala Gin Leu Asp Asp He Glu 
245 250 255 

Ser His Val Gly Arg Ala Ser Ser Phe He Arg Gly Gly Thr Asp Gin 
260 265 270 

Leu Gin Thr Ala Arg Val Tyr Gin Lys Asn Thr Arg Lys Trp Thr Cys 
275 280 285 



He Ala He He He Leu He He He He Thr Val Val Val Leu Ala 
290 295 300 



Val 
305 
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SEQ. ID. No. 5 

TTTAGATTTACTCTTATATTAGTTTGTTTGTTTAATTGGACGGTTGTTATATCTTTTTCTTA 
ATATGAGATTTATGTCGTTATTAATGTTTTTCTCTTGAGGGTTCATAAAGAGATTTATCGTG 
TACCTGGGGGTAGGTCAAATGAGAAGGGGTGTAATTTTTGTTTTTTTTTTAGGTTTTATTGT 
GTTTTATTATTCGTACCGATTTTATTATTTTATATTTTTAAATCTTATAAGTTTTGTAACTT 
CCCCAGGTGGTCTTCTGGAAACTGGTATCTGTTTAAGAGTAAAAAAGGTACCGACTTATCTT 
TCTTGGTGGTGGTTTTACTACTATTCGTCTTCTTATTATCGTTTTGTTCAGGTAAAAGATCA 
CAAGAAGACCACGAAGGAACGGTGAACGTCAACGACTCGTGGGGGAGATTGCTTCCTTAATC 
GGGATAAGTGAACCGAGAGTTATAGTAGTTCAACTCGAGGAACTGAAAGTTGGTTTTGTCGG 
TACAGGTTCTTATGAACTACGTTAAGAAAGTTTAAGGAGAGGTTAAGGAAGTGTCGAAGTAC 
GGAAAGAACTTAAAGGTATTGACATAGGTAGTGAACAGGAGAGGGAACGAGAACTTAACGGA 
AAACGTTCTTGCAGAGTGAAACTGGACATCTATACTCACATAGTTCTTGACGAAGTAGTCCT 
AAAAGAGGACACTGCCATATTATAGCAGCAACTTGACAAAGGGATATAAGTCTACGGTAGAA 
AACGGAATCGAGCAACTTGACTAAGTAACTGAGAACTTCAAAGAAGGAATTAGGCAAGTGTT 
GACTTCAAGAGCAGQACAGTCTACTTGAAGGCCCAGGTGTAGGTCCCTCCGAAGCTAAGCGA 
CGTAAACTAGACAGATTCCGAAGCTCTGCTTGAAACTACTTAAACCGAAAGAAGTTACGGTA 
CCTTTGCAGTAATAGGTACAACCTAGAATCTAGAAATTGCCGAAATCGCAACACTTCTCAGA 
ACGAAAAAAGTACCCTTCTAACCTCAACCCTTATCTAAAAGAGCTCCGGAAACTCAAGCAGA 
AATTACCGAAGTTGTAGAAGCTTCTTAAACAGCTCTAACTGAGGCGGTCATTACAGAGGGTA 
AAGATACCGCACTCTCAGGCTAACTAGCAGTCGAGACTTTGCTCTCTTTCTAGGACTTTTAT 
CTAGTAAGTAAAACTCTACCCTAAACC 



